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This paper illustrates asymptotic properties for a response-adaptive 
design generated by a two-color, randomly reinforced urn model. The 
design considered is optimal in the sense that it assigns patients to 
the best treatment, with probability converging to one. An approach 
to show the joint asymptotic normality of the estimators of the mean 
responses to the treatments is provided in spite of the fact that allo- 
cation proportions converge to zero and one. Results on the rate of 
convergence of the number of patients assigned to each treatment are 
also obtained. Finally, we study the asymptotic behavior of a suitable 
test statistic. 

1. Introduction. The present paper is devoted to studying asymptotic 
properties of sequential, response-adaptive designs generated by a two-color, 
generalized Polya urn that is reinforced every time it is sampled with a ran- 
dom number of balls that are the same color as the ball that was extracted. 
The model is based on the two-color, randomly reinforced urn studied in 
[22] and [18], and for conciseness we will denote it by the acronym RRU. 
An RRU generalizes to discrete or continuous responses the urn models 
proposed initially in [9] and [17] for dichotomous responses, and which were 
also applied to select an optimal dosage in [8]. This work has been stim- 
ulated by the fact that a design driven by an RRU allocates units to the 
best response with a probability that converges to one. In the context of 
a response-adaptive design used to allocate patients in a clinical trial, this 
property is desirable from an ethical point of view; for this reason, the results 
obtained in this work will be illustrated within a clinical trial framework. 
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However, the reader should note that response- adaptive designs are of fun- 
damental importance in many areas of applications, for instance in industrial 
problems. 

The experimenter has two simultaneous goals: collecting evidence to de- 
termine the superior treatment, and biasing the allocations toward the better 
treatment in order to reduce the proportion of subjects in the experiment 
that receive the inferior treatment. Patients enter the experiment sequen- 
tially and are allocated randomly to a treatment, according to a rule that 
depends on the previous allocations and the previous observed responses. 
A vast number of adaptive designs have been proposed in recent years; in- 
formed reviews are found in [28] and in [13]. Many of them are based on 
generalized urn models; see, for instance, [27], which traces an historical de- 
velopment of generalized urn models, their properties and applications in 
sequential designs. Among more recent literature on urn models and adap- 
tive designs, there are, for instance, [3, 4, 16, 30]. Notwithstanding their 
generality, their assumptions don't cover the RRU. A major reason is that 
the mean replacement matrix of an RRU is not irreducible. Although in 
the past many response-adaptive designs have been focused on binary re- 
sponses, more attention recently has been given to continuous outcomes. 
Among others, we note the procedures proposed in [2, 14, 29]. 

Consider a clinical trial conducted to compare two competing treatments, 
say B and W, and a response-adaptive design. Indicate, with Nb{ti) and 
N\y (n) , the number of patients allocated through the nth patient to treat- 
ment B and W, respectively. Many of these designs allocate patients tar- 
geting a certain proportion p 6 (0,1); the proportions of patients Nb{ji)/ti 
and Nw(n)/n allocated to each treatment converge almost surely to p and 
1 — p, respectively, where p may be ad hoc or may be determined by some 
optimality criteria, which are usually a function of the unknown parameters 
of the outcomes. The adaptive design considered in this paper is different, 
because its optimality property is to assign patients to the best treatment 
with a proportion that converges almost surely to 1, while the proportion 
of patients allocated to the inferior treatment converges to 0. When the two 
treatments are equivalent, the design allocates the proportion of patients 
with a random limit in [0, 1] . 

After the specification of the model and the provision of preliminary re- 
sults in Section 2, the first part of this work, included in Section 3, is dedi- 
cated to the study of the exact rate of convergence to infinity of the sample 
sizes Nb(ti) and N\y(n). Because the treatment given corresponds to the 
color of the ball that is drawn, Nb(u) and Nw(n) also correspond to the 
number of balls of each color sampled from an RRU through the nth draw. 
Moreover, we obtain the order of convergence of the process representing 
the proportion of black balls contained in the urn at every stage. 
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The asymptotic properties of response-adaptive designs studied in liter- 
ature are usually based on the hypothesis that the target allocation p is a 
determined value in (0, 1). In some of those procedures, despite the random- 
ness of the number of patients Nb(ti) and N\y(n) allocated to treatments B 
and W, respectively, and the complex dependence structure of the random 
variables involved, it has been proved that joint normality of the estimators 
of the mean responses based on the observed responses still holds. An im- 
portant contribution is given in [21], where the authors provide a general 
method to prove consistency and an easy, general, nonmartingale approach 
to prove asymptotic normality of estimators based on adaptively observed 
allocations. As the authors show, their method can be applied to a wide 
class of adaptive designs targeting an allocation proportion p € (0,1). Al- 
though their basic framework covers the adaptive design considered in this 
paper and strong consistency of the adaptive estimators of the parameters 
involved is derived from it, their method can't work for proving asymptotic 
normality when p is exactly equal to one or p may have a random behavior 
as in the RRU procedure. Therefore, in Section 4, we prove that the joint 
asymptotic normality of the estimators of the mean responses still holds, 
both when the two treatments are equivalent and when one treatment is 
superior. The argument used resorts to a martingale technique that involves 
the concept of mixing convergence; mixing convergence is required to obtain 
the distribution of the two-sample t-statistic. 

In Section 5, we consider the following hypothesis test: the experimenter 
wants to test the null hypothesis that the two treatments are equivalent, 
in the sense that the mean responses ps and pw are equal, against the 
alternative hypothesis that ps > Pw- F° r this reason, we are interested in 
the distribution of the usual test statistic Co for comparing the difference 
of the means, based on the observed responses, both under the null and 
the alternative hypothesis. When the target allocation is a value p in (0, 1), 
from the joint normality of the estimators of the means, it can be deduced 
using Slutsky's theorem that the test statistic Co still has an asymptotic 
normal distribution. Also, under the alternative hypothesis, Co has the same 
asymptotic noncentrality parameter as in the classical case, in which the 
sample sizes hb and nw are deterministic. For a review of the approach to 
these situations, see [12, 13, 29]. 

In the response-adaptive procedure considered in this paper, since the 
limit allocation is a random variable under the null hypothesis, we can't 
apply directly Slutsky's Theorem to derive the asymptotic normality of the 
test statistic Co from the joint normality result of estimators. Notwithstand- 
ing, it is proved that under the null hypothesis the asymptotic normality of 
Co holds in this procedure. The proof uses the mixing property of the conver- 
gence as described in Section 4. Moreover, under the alternative hypothesis, 
it is showed that Co is a specific mixture of normal distributions, and we 
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characterize its representation. A final discussion concludes the paper. To 
make it easy for the readers, proofs are located in the Appendix. 

2. Model specification and preliminary results. Let {(Ye(n), Y^{n)) : n > 
1} be a sequence of independent and identically distributed random response 
vectors with marginal distributions Cb and C\y, discrete or continuous on 
M. Consider an urn containing initially b black balls and w white balls, where 
b and w are two strictly positive real numbers. With the arrival of the first 
patient (n = 1), a ball is drawn at random from the urn and its color is ob- 
served. We define a random variable 8\ that we assume to be independent of 
the potential response vector (Ys(i), Yw(i)) for every i > 1 such that 8± = 1 
if the extracted ball is black, while 8\ = if the extracted ball is white. So 5\ 
is a Bernoulli random variable with parameter Zq = b/(b + w). After the ball 
is extracted, if it is black, it is replaced in the urn together with U(Yb(1)) 
black balls. Otherwise, if it is white, it is replaced in the urn together with 
U(Y\y(l)) white balls, where U is an arbitrary measurable function such that 
U(Yb(1)) and U(Yw(l)) have distribution on a nonnegative and bounded 
real set. In typical applications, U will be a monotone function. (Note that 
U may be chosen as the identity function when the distributions Cb and 
Cw have nonnegative and bounded support.) 

This process is then reiterated at every instant n + 1, n > 1. A ball is 
extracted and we define a random variable 5 n+ \ , indicating its color 5 n+ \ = 1 
if the ball extracted is black and <5 n +i = if the ball extracted is white. We 
always assume that 5 n +i is independent of the potential response vector 
(YB(i),Y\v(i)) for every i > n + 1. After the ball is extracted, it is replaced 
in the urn together with 

5 n+1 U(Y B (n + 1)) + (1 - S n+1 )U(Y w (n + 1)) 

balls of the same color. So, given the cr-algebra 

r n = a{8 1 ,5 1 Y B {V) 

(2.1) 

+ (1 - 8 l )Y w {\\ 5 ni 8 n Y B {n) + (1 - 8 n )Y w (n)), 
<5 n _l_i is Bernoulli distributed with parameter 

Z = — 

n B n + W n ' 

where 

n 

B n = b + J2SiU(Y B (i)), 
i=i 

n 

W n = w + J2(l-8i)U(Y w (i)). 

i=l 
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The RRU procedure drives the allocations {5 n }. When 8 n is 1, allocate the 
nth patient to the first treatment, say treatment B, and let the random 
variable 1b (n) be the potential response of nth patient to treatment B. 
When 5 n is 0, allocate the nth patient to the second treatment, say treatment 
W, and let Yw(n) be the potential response of nth patient to treatment W. 
Only one response for the nth patient will be actually observed, which we 
can write as Y(n) = 5 n YB(n) + (1 — 5 n )Yw(n). 

We thus generate the following processes: the sequence {5 n :n > 1} of 
Bernoulli random variables and the sequence {Z n :n > 0} of random vari- 
ables in [0,1], representing the proportion of black balls present in the 
urn at every stage. Now the number of black balls and white balls that 
have been drawn from the urn through the nth treatment allocation can 
be written as Ns(n) = Ya=x^i an d N\v{ n ) = X^LiU ~~ ^i)> respectively; 
clearly, Nb(ti) + N\y(n) = n. Also note that B n and W n are the cumula- 
tive (transformed) observed responses to treatment B and W, respectively, 
adjusted by the initial numbers of balls in the urn. We call B n and W n 
cumulative responses for short. 

Let tub = J U(y)CB(dy) and mw = J U(y)Cw(dy) be the means of the 
transformed responses. Then, from [5] and [22], we have the following limit 
for the proportion of black balls in the urn. 

Theorem 2.1. If ms > mw, then lim n ^ 00 Z n = 1, almost surely. 

Suppose now that the sequences of responses {1b(^)} and {Yw(n)} have 
finite means [J-b = J y£-B{dy) and [iw — / y£-w{dy) and that, for instance, 
the treatment B is preferred to the treatment W if fis > ^w- Then, choosing 
a function U such that fis > if an d only if tub > mw an d [Ib = fJ>w if 
and only if tub = mw, Theorem 2.1 ensures that a RRU -design allocates 
patients to the superior treatment with probability converging to one as n 
goes to infinity. 

Remark 1. As suggested in [23], the existence of a bounded function 
U such that 

j yC B {dy) > J yCw(dy)^ J U(y)£ B (dy) > J U(y)£ w (dy) 

and 

' y£ B (dy) = I y£ W (dy) O / U{y)C B (dy) = / U(y)£ w (dy) 



is guaranteed by the theory of utility. In the situation illustrated in this 
paper, the experimenter expresses a preference among the distributions of 
responses in terms of the ordering of their means. Indeed, he may also ex- 
press a different preference between Cb and Cw- The theory of utility gives 
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conditions which guarantee the existence of a bounded utility function U 
such that the expected utilities of the elements of a class of probability dis- 
tributions on R are ordered in the same way as a certain preference among 
the probability distributions (see, e.g., [7]). 

2.1. Preliminary results. The process {Z n :n > 0} of the proportions of 
black balls is of primary interest for the study of stochastic processes gen- 
erated by this particular generalization of Polya urn. Moreover, in a RRU- 
design, Z n represents the conditional probability of allocating the nth pa- 
tient to treatment B; the asymptotic behavior of this process is also essential 
for analyzing the asymptotic normality of estimators for these designs, as 
will be made clear in the next sections. [22] provides the following general 
result. 

Proposition 2.2. The sequence of proportions {Z n :n>0} is eventu- 
ally a bounded super or sub-martingale. Therefore, it converges almost surely 
to a random limit in [0, 1] . 

When the urn is reinforced by the random variables U (Yg(n)) and U(Y\y(n)) 
with means tub and mw such that tub > mwi then, as given by Theorem 
2.1, the limit is equal to 1 almost surely. 

Consider the case in which Cb = C-w so that the urn reinforcements 
U(YB(n)) and U{Yw{ n )) have the same distribution, say [i. Then, in [20], it 
is shown that P{Z 00 = x) = for every x G [0, 1] . However, the exact distri- 
bution of Zqo is unknown except in a few particular cases. When \x is a point 
mass at a non-negative real number m, the RRU degenerates to Polya's urn, 
in which case Z^ has a Beta(6/m, w/m) distribution. This is also the case 
for binary responses (success/failure) when m balls are added to the urn af- 
ter each success is obtained. In fact, [1] proves that only the nonnull part of 
the reinforcement distribution needs to be considered. For the general RRU 
with Cb = £>Wi [1] characterizes the distribution of Z^ as the unique contin- 
uous solution, satisfying some boundary conditions, of a specific functional 
equation in which the unknowns are distribution functions on [0, 1] . 

When me = m\y, it may happen that j U(y) k Cs{dy) / / U{y) k Cw{dy) 
for some k>2, and then U (Ye(n)) and U(Y\y(n)) may have different distri- 
butions. This is of particular interest because it corresponds to a situation 
in which the two treatments are considered equivalent. In Section 3 of this 
paper, we establish a fundamental property of Z^ when tub = myy, that is, 
P{Z (XJ = 1) = P(Zoo = 0) = in this case. 

The following preliminary result regarding the limiting sample sizes on B 
and W is important for showing the asymptotic normality of the common 
statistic for testing differences in mean responses. 
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Proposition 2.3. Nb(ti) and N\y(n) converge to infinity almost surely 
as n — > oo . 



Now, let 

r n = inf < k : (5j = n > and i/ n = inf < k — 5j) = n > . 

Thus, r n = j indicates that the nth observed response to treatment B occurs 
for patient j, and {Yb(t„,)} is the subsequence of the potential responses 
{Ye(n)} that are the observed responses to B. Similarly, v n = j indicates 
that the nth observed response to treatment W occurs for patient j, and 
the subsequence of potential responses {Yw (n)} to W that are the observed 
responses is {Yw(f n )}- [6] proves independence properties of the sequences 
of observed responses, so that the strong consistency of estimators based on 
those sequences can be deduced. 

PROPOSITION 2.4. The sequences {Y B (T n )} and {Y w (v n )} are i.i.d. with 
distributions Cb and Cw, respectively, and are independent one of each 
other. 



As a consequence of this proposition, we can model the observed responses 
of random sizes Ns(n) and Nw(n) to treatments B and W as samples 
from two i.i.d. populations generated by Lb and Cw, respectively. Assume 
that Cb and Cw depend on unknown parameters 9b and Ow- We have the 
following. 

Corollary 1. Suppose that Ob and 9w are estimators of 9b and 9b 
based on n-dimensional samples from two independent i.i.d. sequences gen- 
erated by Cb and Cw, respectively. Let 9b and 9w be the corresponding es- 
timators computed on observed responses through time n in the RRU -design 
[which have random sample sizes Nb(u) and Nw(n), resp.J. If 6b and 9w 
converge a.s. to 9b and 8w, respectively, then also 9b and 9w converge a.s. 
to 9b and 9w ■ 

3. Rates of convergence of AT B (n) and Nw(n). The aim of this section 
is to study the rate of convergence to infinity of the sample size sequences 
Ns(n) = Yn=l &i an d Nw( n ) = Ya=iO- — Si). We also obtain the rate of the 
convergence of the process Z n . Results for dichotomous responses can be 
found in [8]. First, we have the following. 

Proposition 3.1. 

f ou v n b( u ) 7 iv Nw ^ 1 7 

(6.1) hm = Zqo a.s. and Inn = 1 — Zoo a.s., 

n— >oo 77, n— >oo 77 
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where is the limit of the process {Z n } representing the proportion of 
black balls in the urn. 

This result doesn't provide the rate of convergence to infinity of Ns(n) 
and N\y(n), because the limits Z^o and 1 — may have a point mass at 0. 
The next theorem gives the exact rate of convergence to infinity of Ns(n) 
and N\v(n). 

Theorem 3.2. If tub > mw then: 

(i) lim n ^ +00 = 1, a.s.; 

(ii) there exist a random variable rj 2 with P(0 < rj 2 < od) = 1 such that 

N w (n) 2 
lim -, — = 77 a.s. 

rc-M-oo n m w /m B 

If m B = THW; then P(Z OQ = 0) = -P(Zoo = 1) = 0. Hence, the limits in 
Proposition 3.1 say that the rate of convergence to infinity of Nb(u) and 
Nw(n) is n, almost surely, in this case. 

Part (i) of Theorem 3.2 follows immediately from Theorem 2.1 and Propo- 
sition 3.1, since when > mw, then Z M = 1 almost surely and NB(n)/n 
converges to Z^. The rest of the theorem is a consequence of an auxiliary 
result concerning the relative convergence rates of the cumulative responses 
to treatments B and W , which holds both when tub = m\y and ms > mw- 

Theorem 3.3. B n /(Wn B/mw ) converges almost surely to a random 
variable ip with support in (0, oo). 

Finally, we obtain a relevant consequence of Theorem 3.2: when > 
m w i the exact order of convergence of the proportion of white balls in the 
urn is the same as was obtained for the proportion of patients allocated to 
treatment W. 

COROLLARY 2. Let n 2 be the random variable given by Theorem 3.2. If 
ms > mw, then 

,. 1 — Z n mw 2 

lim ; = 77 a.s. 

rw+oo n m w /m B -l mB 

Remark 2. There is a direct relationship among the random variable 
Zqo an d the limits in Theorem 3.2 and Theorem 3.3. When ms = mw, then 
Zoo = and 1 - Z - = ^r- When m B > m w , then n 2 = ^(^) m ^/ m \ 
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Remark 3. Theorem 3.3 has been inspired by some ideas contained in 
[25]. Actually, even when mg > mw, Theorem 3.3 doesn't follow from [25], 
Lemma 3.5, readapted to our case, which would assert that log W n / log B n — > 
mw/mB as re — > +oo, almost surely. As a counterexample, consider the case 
in which the rate of convergence of B n is n, while the rate of convergence 
of W n is n mw l mB log?i. (This is also a counterexample to the fact that their 
true Theorem 2.2 is not a consequence of [25], Lemma 3.5.) 

4. Asymptotic normality. Consider estimation of the means fi B and jiw 
of the responses to treatments. We define the following estimators based on 
the observed responses through patient re, with random sample sizes N B (n) 
and Nw(n), respectively: 

(4.1 Y B (n) = — — and Y w (n) = — — . 

N B (n) N w (n) 

Corollary 1 and the strong law of large numbers ensure that the (4.1) are 
strongly consistent. In this section, we show that these estimators, appropri- 
ately standardized, are jointly asymptotically normal, despite the random- 
ness of N B (n) and Nw(n), their dependence, and the fact that they don't 
satisfy the classical assumption, as also is required in [21], Theorem 3.2, that 
N B (n)/n and Nw(n)/n converge in probability to a constant in (0,1). 

Before proceeding, we need to recall the concept of mixing convergence, 
which was introduced by [26] and provides a particularly elegant approach 
to martingale central limit theory (see [10] and [11]). 

Definition 4.1. Consider a sequences of random vectors Y n = (Yi(n), 
. . . ,Y p (n)) on a probability space (Q,J-,P) that converges in distribution to 
Y = (Y\, . . . , Y p ). We say that the convergence is mixing if, for every point y 
of continuity for the distribution function of Y and for every event E £ 

UmP(F 1 (re) < Vl , .. ., Y p (n) < y p ,E) = P(Y 1 <y 1 , . . . ,Y p < y p )P{E). 

The mixing property of the convergence will be essential to our study 
of the asymptotic distribution of the test statistic in the following section. 
During this section and the next one, we will assume that the distributions 
of the responses C B and Cw have finite variances, which we indicate by as 2 
and o~w 2 . The next theorem establishes the joint asymptotic normality of 
estimators (4.1). 

Theorem 4.1. Either when fj> B = H-w or when \i B > [tw , 



f^(y4n)-MB),™^(%(n)-m))-^(0,I) (mixing). 
V a B aw J 
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5. Testing hypothesis. In this section we consider the hypothesis test on 
the mean responses 

Hq\^b = Pb versus H\:pB> Pw- 

Observe that, from Remark 1, these hypotheses are equivalent to Hq :?tib = 
m b versus H\ :mB> my/ . We characterize the classical statistic for two 
samples {Y#(n)} and {Yw(n)}, i.i.d. and with law Lb and Cw, respectively, 
when applied to the response-adaptive design that motivates this paper. 

First, consider the usual two-sample test statistic observed in a fixed 
design with sample sizes n B and nw, respectively, 



(5-1) Co 



Y B (n B ) - Y w (n w ) 



s B /n B + sfo/nw 



where Yb (n B ) and Yy/ (nyy) are the sample means, and s B and Sy^ are 
consistent estimators of the variances. Suppose that, for some p£ (0,1), 
n B /(n B + nw) — ► P, nw /{ n B + nw) —> 1 — P as n B and nw — > oo; then, 
from the classical central limit theorem, 

(5.2) (^l{Y B {n B )-p B ),^{Y w {n)-p w )) — * jV(0,I), 

V &B &W J 

we can deduce that Co converges in distribution to a standard normal random 
variable under the null hypothesis, while it is asymptotically normal with 
noncentrality parameter 

(5.3) 4> = - Ky/a- ^ - m 



VB 2 /n B + (Tu / /n w ^a B ' 2 /p + (Jw 2 /{l- p) 



bl- 
under the alternative hypothesis. 

Now, consider a response-adaptive procedure with random sample sizes 
such that N B (n)/n — ► p and Nw{n)/n — ► (1 — p), where p is a determined 
value in (0,1), even if unknown a priori. If the result (5.2), with N B (n) 
and N\y(n) replacing i%b and nyy, still holds, it can be deduced, similarly 
to the classical case and using Slutsky's theorem, that the test statistic Co 
[with Nb(ti) and Ny/{n) replacing ns and nw] still has asymptotic normal 
distribution, and the same asymptotic noncentrality parameter under the 
alternative hypothesis. A discussion on the power of the test and the sample 
size calculation in this context can be found in [13]. 

Examine now the -R-RL'-design considered in this paper. Under the null 
hypothesis, Nsinj/n and Nw(n)/n converge to random limits and 
(1 — Zoo), respectively. So, we can't derive the asymptotic normality of the 
test statistic as in the classical case; we need the mixing property of the 
convergence proved in Theorem 4.1. Also, under the alternative hypothesis 
we find a different situation: ATg(n)/n — > 1 and Nw(n)/n — ► 0, almost surely. 
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This implies that, asymptotically, Co( n ) carries information about the value 
of the mean of one only treatment. Notwithstanding this loss of balance, the 
use of the test statistic Co( n ) is still reasonable because the rates of N B (n) 
and N\y(n) carry information about the difference between [i B and nw 
Let us define 

n 

CT 2 B (n)=Y / ^i(yB(i)-Y B (n)) 2 /N B (n) 
i=l 

and 

n 

*w(ri) = E(l " Si)(Y w (i) - Y B (n)) 2 /N w (n), 

i=l 

which are strong consistent estimators of the variances a B and afy from 
Corollary 1. 

Theorem 5.1. The random process 

(5.4) gn) = f j n) ~ fw{n) ~ (MB ~ <* w) N(0, 1) (mixing), 
^o- 2 B /N B (n)+a^/N w (n) 

both under the null and under the alternative hypothesis. Moreover, the same 
result holds with a B (n) and <rjy(n) in (5.4), instead of a B and a^y. 

Let us examine the distribution of the test statistic 

Y B (n) -Y w (n) 

Co{n) = . 

^Ja 2 B (n)/N B (n) + a 2 w {n)/N w (n) 

When Hq is true, Co( n ) is asymptotically normal from Theorem 5.1. So, 
one can then construct the following critical region with asymptotic level of 
significance a: 

C a = {CoW > Zi- a }. 

Let 77 be the positive square root of rj 2 . The following corollary establishes 
that, when the alternative hypothesis is true, the test statistic Co( n ) is a 
mixture of normal distributions and characterizes its representation. 

Corollary 3. Under the alternative hypothesis, the conditional distri- 
bution of Co( n )> given the random variable rj 2 defined in Theorem 3.2, is 
asymptotically normal with mean equal to V n mw / mB r\ flB ~ l ^ w and unit vari- 
ance. 

Remark 4. Simulations on the mixing distribution n are provided in 
[19] and [15]. In particular, it is shown that 77 is not a point mass, and this 
implies, by Corollary 3, that C( n ) is not asymptotically normal under the 
alternative hypothesis. 



12 



C. MAY AND N. FLOURNOY 



6. Discussion. In Section 3, we have proved that the rate of convergence 
of the number of patients assigned to the worst treatment is determined by 
the ratio of mw and mg: the smaller the value of this quantity, the more 
slowly Nyy(n) increases. However, for smaller values of mw/mB, the prob- 
ability of assigning patients to the best treatment converges to one faster. 
On the other hand, the study of the noncentrality parameter in Section 5 
shows that small values of mw/ m B cause a loss of the power of the test 
for treatment mean differences. This observation generates open research 
questions regarding the best choice for the function U that determines the 
values of tub and mw- 

The theoretical distribution of the test statistic £o for mean responses 
comparison has been studied in Section 5: Co is asymptotically normal un- 
der the null hypothesis, while it is a specific mixture of normal distributions 
when the alternative hypothesis holds true. Hence, the law of the mixing 
distribution rj is of fundamental importance for calculating the exact power 
of the test. The distribution of r\ is an open problem; some of its proper- 
ties are investigated by simulation in [19] and [15]. Without knowing the 
exact distribution of r], the power of the test statistic must be evaluated by 
simulation. 

Simulated results concerning the performance of an i^i^L^-design are pro- 
vided in [24] and [23] (for normal responses), and in [15] (for binary re- 
sponses). The number of treatment failures to achieve a determined power 
is compared with a default design. Simulations show that an i?i?[/-design 
is a viable alternative when the difference between the mean responses to 
treatments has moderate to large values. The -Ri?C/-design is also appealing 
with binary responses when success probabilities are small. 

This work needs to be extended to the K-treatment problem. In [8] and 
[22] limiting results for the proportion of balls of each type generated by the 
RRU in the K-treatment problem has been obtained. But they did not obtain 
associated distribution theory for comparing treatment means and dealing 
with the associated multiple comparisons. We have laid a foundation for 
such results in this paper, but we believe the required extension will contain 
technical difficulties, and so we leave it now as an open problem. 

In the present work, we have solved some asymptotic theory problems 
generated by the fact that an RRU-design has a very desirable property 
that can't be approached with the usual methods presented in literature. 
We wish that this study may offer a contribution to the development of 
research in response-adaptive, optimal designs. 
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APPENDIX: PROOFS 

Proof of Proposition 2.3. Set (3 such that P(U(Y W (1)) </3) = l. 
Following [22], Theorem 2, if r = inf{n > 1 : 5 n = 0}, then, for k > 1, 

b b + (3 b+(k-l)P 



P(t >k)< 



b + wb + w + (3 b + w + (k-l)/3 

/fc-i 
exp [J2 lo § ( 



b + nfl 
Vn=0 yb + w + nP 

Since J2^=o log[(6 + nfS) /{b + w + nf3)\ = — oo, it follows that lim^oo P(t > 
k) = 0, and hence P(t < oo) = 1. From the strong Markov property we 
obtain that 

P(5 n = 0, i.o. ) = 1, 
and then N\y(n) — > oo, a.s. The proof for iYe(n) is similar. □ 

PROOF of Proposition 3.1. Since E(^|^i_i) = and (from Propo- 
sition 2.3) Ya=\ $i ~^ °°> almost surely, it follows from Levy's extension of 
the Borel-Cantelli lemma that, almost surely, 

^]Zi_i^oo and ' =1 ► 1. 

Because Xm\ n ^ 00 Z n = Z^, almost surely, (X)E=i ^i-i)/ ri converges to 
by Cesaro's lemma. Hence, (Sr=i <^)/ n converges almost surely to Z^. □ 

In order to prove Theorem 3.3, we need the following two lemmas. 

Lemma A.l. If tub > m\y , then: 

(i) the rate of convergence of B n is n, almost surely; 

(ii) the rate of convergence of W n is greater then n 1 for some real con- 
stant 7 > 0, almost surely. 

If m B = TUw i then: 

(iii) the rate of convergence of B n + W n is n, almost surely; 

(iv) the rates of convergence of both B n and W n are greater than n 1 , 
almost surely, for some real constant 7 > 0. 

Proof, (i) This is an immediate consequence of Corollary 1 and of the 
fact that when tub > rn\y then Nb(ji)/ti — > 1 (using Proposition 3.1 and 
Theorem 2.1). 

n— >+oo n n—>+oo n n 

= hm — I = m B a.s. 

n— >+oo n JSB\n) n 
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(ii) Let's consider the conditional increments of the process B n /W^, for 
some k > m B /mw- 



E 



B 



n+l 



n+l 



B n 



B n 



B n + W n 



-E 



B n + U(Y B (n + l)) 



Bn 



•F n 



+ 



w„ 



-E 



Br, 



Bn + Wn \{W n + U{Y W (n+l))Y 

B n E(U(Y B )) 



B n + W n 



+ 



W n B n 



■E 



1 



Bn 



1 



Bn + W ri \{W n + U(Y W {n+l))Y 

By a Taylor expansion of the function f(x) = \/{a + x) K with x = U(Yw(n + 
1)) and a = W n , we can choose a constant c such that whenever W n > 1 



E 



1 



XW n + U{Y w (n + l))Y 
Thus, we obtain that 

Bn+i B n 



1 " W" 



W, 



T-nV{u(Y w)) - — 



E 



(A.l) 



W K , i 



Tn 



< 



Br, 



1 



B n + W n W- 



E(U(Y B )) - kE(U{Y w )) + 



KC 
Wn 



Prom inequality (A.l), note that if k > E(U(Y B ))/E(U(Y W )), then the pro- 
cess B n /W£ is (eventually) a positive supermartingale, and then it con- 
verges, almost surely, to a finite limit. Since from part (i) of the lemma 
B n /n — > m B a.s., it follows that also n/W£ converges almost surely to a 
finite limit. Hence, for every e > 0, n/W£ +e converges a.s. to 0. This means 
that W£ +e > n eventually, that is, W n > n l /^ K+e ^ a.s., eventually. 

(hi) If m B = m\y, then lim n ^ +00 Bn ^ n =m B , almost surely, because 

b + w , E?=iSiU(Y B (i)) 



B n + W n 



lim 



+ 



n 



»+oo n n 
Y.U{l-5i)U{Y w (i)) 



n 



Um b + w | T,i=iSiU(Y B (i)) N B (n) 



+ 



N B (n) n 
E?=i(l " $i)U(Y w (i)) N w (n) 



■\n 



N w (n) 



n 
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which, by Corollary 1 and Proposition 3.1, converges almost surely to mBZ 00 + 
mw(l — Zoo). Since mg = raw-, this is equal to mg. 

(iv) From the part (iii) of the lemma, it follows that, eventually, B n + 
W n > n ■ tub ■ 3/4 on a set of probability one. Let Fb '■= {B n > n ■ mg/4, 
eventually} and F w := {W n > n ■ mg/4, eventually}. By Proposition 2.2, Z n 
converges a.s. in [0,1]. Then, the ratio B n /W n converges a.s. in [0,+oo] 
and, therefore, P(Fb U Fw) = 1. In fact, on [Fb U Fw) c , we have that 
liminf n fi n /n < mg/4, limmi n W n /n < mg/4 together with i? n + W n > 
n • tub ■ 3/4 that implies liminf n B n /W n < 1/2 < 2 < limsup n B n /W n , that 
is, Z n does not converges on (Fb U i 7 V) c - Hence, P(Fb U i 7 iv) c = 0. What 
remains to prove is that, on Fb (resp. on .Fw), W n (resp. B n ) is eventually 
greater than n 7 , for some real constant 7 > 0. 

Let us focus on what happens on the set Fb- Using the same argument 
as in the proof of part (ii) of this lemma, it follows from equation (A.l) that 
for k > 1 the process B n /W^ is (eventually) a positive supermartingale, and 
then it converges almost surely to a finite limit. This implies that, for every 
e > 0, B n /W^ +e converges to and then 

(A.2) B n /W* +£ <1 on F B , eventually. 

Since, on Fb, B n > n ■ mg/4 eventually, then, by (A.2), we deduce that, 
on F B , n/W n K+£ • m B /4 < 1, a.s., that is, W n > n l ^ K+£ ^ ■ {m B /A) l ^ K+£ \ 
eventually. □ 



The second lemma is a general fact about convergence of random se- 
quences; for lack of a better reference, see [25], Lemma 3.2. 

Lemma A.2. Let {£ n :n > 0} be a random sequence that is measurable 
with respect to a filtration {^F n }. Define 

= E(£ n+ i — Cnl-^n); 
Qn = E((£ n+ i — £n) 2 |-Fn)- 

IfJ2n^n < +00 and J2nQn < +°o on a set of probability one, then £ n con- 
verges to a finite random variable, almost surely, as n goes to infinity. 



Proof of Theorem 3.3. We apply Lemma A.2 to the process 

Bn 

^ = log 



w rn B /m w 



in order to prove that it converges almost surely to a finite random vari- 
able. This implies that B n /(W n nB ^ mw ) converges almost surely to a strictly 
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positive and finite random variable. 
An = E(£ n+ i — £„|jF n ) 



ra B 



E(log£ n+ i - logB n \F n ) ^E(log W n+ i - log W n \ T, 



mw 



B n 



B n + W,_ 

ra B W n 



ra w B n + W n 



E(log(£ n + U(Y B (n + 1))) - logfl B |.F n ) 
E(log(W n + U(Y w (n + 1))) - log 



B n 



-E 



U(Y B (n+l)) i 



B n + W n \Jo 
m B W n 



dt 



-E 



raw B n + W n \Jo 



B ri + r 

U(Y w {n+l)) l 



■dt 



W n + t 

By a Taylor expansion of the function f(x) = l/(x + 1), it follows that, 
for B n sufficiently large, there exist constants c\ and such that, for every 



(A.3) 



B 



1 t 1 1 

C2Tv7 < < 



Bn B n + t B n Bl 



+ ci 



and, similarly, for l/(W n + t). Hence, it follows that 



(A.4) 



1 



B n + Wn\B n Wl 



h-\ ho \ 

-J- + _L ) < A„ < 



~B~3 



ki . k? 



B n + W n \Bl W n , 

for some constant k±,k2,hi,h2- Thanks to the rates of convergence of B n 
and W n shown in Lemma A.l, we obtain that J2 n A n < +oo, a.s. 

Qn = E((£ n+ i - in) 2 \Fn) 

B ^—E((log(B n + U(Y B (n + 1))) - log B n f \ F n ) 



Bn + W n 

w n 



+ 



B n + W n 
B 



E((log(W„ + U(Y w (n + 1))) - log W„) Vn) 



B n + W n 



-E 



U(Y B (n+l)) i 



■dt 



+ 



Wn 



ra B 



mw ) B n + W n 



-E 



B n + t 

U{Y w (n+l)) i 



W n + t 



dt 



Since, for positive t, B 1 +t < -g- and w l + f < we obt 



W n +t - W n ■ 



am 



B n (E(U{Y B Y)\ , fm B \ 2 W n (E{U{Y W 



Qn < E I - 

^ - B n + W n \ B r , 



+ 



\m w ) B n + W n { W n 2 
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hence, J2nQn < +00, a.s. □ 

Proof of Theorem 3.2. Let m B > mw- Observe that 



n->+oo ^B/mw n->+oo + £^ =1 (1 - <5j)C/(Yiy (j)))^/^ 

ZU$iU(YB(i))/N B (n) 



— lim 

x jMn) 
N w {n) mB/mw ' 

From Theorem 3.3, we know that this limit is equal to ij) € (0,oo). From 
Corollary 1, we have that Ya=i 5iU(Y B (i))/N B (n) and E"=i(!- WOMO)/ 
Ny/{n) converge a.s. to mg and mw, respectively, and N B (n)/n converges 
a.s. to 1. So, it follows that N\y(n) mB ^ mw /n converges a.s. to a random 
variable in (0, oo). Hence, we obtain part (ii) of the theorem. 

Let m B = m\y. From Proposition 3.1, we know that, almost surely, N B (n)/ 
n — > Zqo and Nw(n)/n — > 1 — Z^. Since, from Theorem 3.3, B n /W n con- 
verges a.s. to ip £ (0,oo), it follows that 

B n B n /W n 
n B n + W n B n /W n + 1 

converges a.s. to ip/(ip + 1), which is then in (0, 1). Hence, 

(A.5) p(Z oo = 0)=P(Z oo = l) = 0. □ 

Proof of Corollary 2. 

1-Z n 1 W n 



n m w /m B -l n m w /m B -l Q n _|_ ^ 

1 w + Y,U(l-W(Yw(i)) 



n m w /m B -i h + w + E n =i 5iU(Y B (i)) + E?=i(l - Si)U{Y W (i)) 
1 



^.myy /iub — 1 

x ( w + (^(l-Si)U{Y w (i))/N w {n)^N w {n) 



b + w+\^6iU(Y B (i))/N B (n)jN B (n) 
+ (f2(l-8i)U(Y w (i))/N w (n))N w (n) 



\i=i 
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= (w/n m ™l m * + f> - SiMYwW/Nwin) • N w (n)/ 

x l(b + w)/n + J25 l U(Y B (i))/N B (n)-N B (n)/ 



,, n 
i=i 

n \ -1 

5^(1 - 5i)U(Y w (i))/N w (n) ■ N w {n)/n . 
i=l / 



Now, using Corollary 1 and Theorem 3.2, the numerator of the last equality 
converges almost surely to rj 2 mw, while the denominator converges almost 
surely to m^. □ 

In order to prove Theorem 4.1, we need the following lemma, which ex- 
tends [10], Corollary 3.2, to the multi-dimensional case. It can be obtained 
using the Cramer- Wold device as in [11], Theorem 12.6. 

Lemma A. 3. Let {X n j = (X n i , . . . ,X n i p ), J- n i, 1 <i< k n } be a zero- 
mean, square-integrable, p- dimensional martingale array differences. Sup- 
pose that 

^E[||X m || 2 /(||X ni || > e)|^ n>i _i] - P for all e > 0, 

i 

and 

S n := ^ E[(X n j)'X n j| T n,£-i] — > F 5j. 

i 

IfS is a positive definite matrix with probability 1, then 
]T X^S" 1 / 2 -> d AA(0, 1) (mixing). 

i 

Proof of Theorem 4.1. Let k n = n and T n % = o-{5\,5iY B (l) + (1 — 
<5i)>V(l), S i3 SiY B (i) + (1 - 5i)Y w (i),5 i+1 ). 

SinceE(^(y B (z)-/i B )|^ n) i_i) = (5 i -E(y B (z)-/i B ) = 0andE((l-^)(iV(0- 
fi>w)\Fn,,i-l) = -E(lV(i) -//vy) =0, it follows that {X ni , F ni , 1 <i < 

fc n }, with 

is a 2-dimensional martingale array differences. X n j satisfies 
^EOlXnif/dlX^H^e)!^!] 
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n m w /m B 



N B (n) 



n 



x I(\Y w (i) - n w \ > ey/n^/mB)^.^ 
E((Yb(1) - Vb?I(\Y b (1) -hb\> ey/n)) 



+ 



N w (n) 



n m w /m B 

Moreover, it can be verified that 

= ^£ E[(X n j)'X n j|jF n.j_i] 



E((y w (l) - W ) 2 I(|y w (l) - ml > eA m «'/™ S )) 0. 



( N B {n)a 2 B 



\ 



n 






N w {n)a w 

n m w /m B J 



and, by Theorem 3.2, S n converges almost surely to a positive definite ma- 
trix I] with probability 1 , either when m B = rnw and when > mw , that 
is, when fj, B = fJ-w and /i# > We can then apply Lemma A. 3 to X n j, 
and we obtain the theorem. □ 



Proof of Theorem 5.1. Let 

a%N w (n)/n 



a%N w {n)/n + a^N B {n)/n 



Then, by Proposition 3.1, 



almost surely, and 



C(n) = ■ vEeM (Y B (n) - (M B ) + yT^K 



&B 



VN w {n) 
aw 



(Y w (n) - fiw) 



VNB(n) 

°B 



(Y B (n) - fis) + v/T^A^ • VNw{n) (Y w (n) - m ) 



aw 



&B 



aw 
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= V Aoo \ Y B(n) - hb) + V 1 - Aoo (Y w {n) - fi W ) 

<T B &w 

+ (1). 

According to Theorem 4.1, it follows that C( n ) converges mixing in distri- 
bution to a random variable having a characteristic function as 



E(exp(-iA 00 t 2 ) • exp(-i(l - A^ 2 )) = e 



-i 2 /2 



which is the characteristic function of a standard normal random variable. 
For the case of unknown variances, define 



7(ra) 



a%N w (n)/n + afoN B (n)/n 



&%(n)N w (n) /n + a^{n)N B {n) /n 
which converges almost surely to 1, by Theorem 3.2. By noting that 
Y B (n) - Y w (n) - (ji B - fi w ) 



^a 2 B (n)/N B (n) + a^(n)/N w (n) 



C(n) -7(n), 



from the mixing convergence of ((n) an d Slutsky's theorem, we obtain the 
thesis. □ 



Proof of Corollary 3. When \i B > /j, w , we have that (o(n) = ((n) + 
(n), where 

4>(n) 



^Ja 2 B (n)/N B (n) + a 2 w (n)/N w {n) 

is the noncentrality parameter. Since, for Theorem 3.2, N B (n)/n mw 'l mB — ► 
oo and Ny/{n)/n mw 'l mB —>, rj 2 almost surely, then, for n large, we can ap- 
proximate the noncentrality parameter with 

<j)(n) 7x \/ n m w l m B Jtf— — — . 



For the mixing property of the convergence in Theorem 5.1, C( n ) i s asymp- 
totically independent of rj 2 , and then we obtain the thesis. □ 
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